A Machine Learning Approach to Measurement of Text Readability for EFL Learners Using Various Linguistic Features
نویسندگان
چکیده
The present paper introduces and evaluates a readability measurement method designed for learners of EFL (English as a foreign language). The proposed readability measurement method (a regression model) estimates the text readability based on linguistic features, such as lexical, syntactic and discourse features. Text readability refers to the comprehension rate of a text (0.0-1.0). The experimental results showed that the proposed readability measurement method yielded higher accuracy than a baseline method, which provides the mode value of the distribution of the comprehension rate data as the estimated value for any input.
منابع مشابه
Computational Linguistics & Reading Comprehension Text Complexity: Implications for the EFL Teacher
This paper reports on a 1-year longitudinal study that aimed at investigating the relationship between intermediate EFL learners’ development of reading and writing proficiency while delineating a range of linguistic features present in a set of 30 reading comprehension texts and 350 written essays. Data (reading comprehension exam scores and written essays) were collected from a junior high sc...
متن کاملLinguistic Features of English Textese and Digitalk of Iranian EFL Students
This study aimed at investigating the English textese of Iranian EFL learners by scrutinizing the linguistic features through a qualitative design. In doing so, 700 messages were collected from 43 MA Iranian EFL learners of both genders. The features were categorized and analyzed calculating the frequency and percentage. The findings of the study showed that Iranian EFL students used different ...
متن کاملRule-based and machine learning approaches for second language sentence-level readability
We present approaches for the identification of sentences understandable by second language learners of Swedish, which can be used in automatically generated exercises based on corpora. In this work we merged methods and knowledge from machine learning-based readability research, from rule-based studies of Good Dictionary Examples and from second language learning syllabuses. The proposed selec...
متن کاملQualitative and Quantitative Examination of Text Type Readabilities: A Comparative Analysis
This study compared 2 main approaches to readability assessment. Thequantitative approach applied idea density based on part of speech tagging andcompared 3 sets of text types (i.e., narrative, expository, and argumentative) withrespect to their ease of reading. The qualitative approach was done throughdeveloping questionnaires measuring intermediate EFL learners’ perceptions oncontent, motivat...
متن کاملEmotion Detection in Persian Text; A Machine Learning Model
This study aimed to develop a computational model for recognition of emotion in Persian text as a supervised machine learning problem. We considered Pluthchik emotion model as supervised learning criteria and Support Vector Machine (SVM) as baseline classifier. We also used NRC lexicon and contextual features as training data and components of the model. One hundred selected texts including pol...
متن کامل